Events are Not Simple: Identity, Non-Identity, and Quasi-Identity

نویسندگان

  • Eduard Hovy
  • Teruko Mitamura
  • Felisa Verdejo
  • Jun Araki
  • Andrew Philpot
چکیده

Despite considerable theoretical and computational work on coreference, deciding when two entities or events are identical is very difficult. In a project to build corpora containing coreference links between events, we have identified three levels of event identity (full, partial, and none). Event coreference annotation on two corpora was performed to validate the findings. 1 The Problem of Identity Last year we had HLT in Montreal, and this year we did it in Atlanta. Does the “did it” refer to the same conference or a different one? The two conferences are not identical, of course, but they are also not totally unrelated—else the “did it” would not be interpretable. When creating text, we treat instances of entities and events as if they are fixed, well-described, and well-understood. When we say “that boat over there” or “Mary’s wedding next month”, we assume the reader creates a mental representation of the referent, and we proceed to refer to it without further thought. However, as has been often noted in theoretical studies of semantics, this assumption is very problematic (Mill, 1872; Frege 1892; Guarino, 1999). Entities and (even more so) events are complex composite phenomena in the world, and they undergo change. 1 This work was supported by grants from DARPA and NSF, as well as by funding that supported Prof. M. Felisa Vedejo from UNED Madrid. Since nobody has complete knowledge, the author’s mental image of the entity or event in question might differ from the reader’s, and from the truth. Specifically, the properties the author assumes for the event or entity might not be the ones the reader assumes. This difference has deep consequences for the treatment of the semantic meaning of a text. In particular, it fundamentally affects how one must perform coreference among entities or events. As discussed in Section 6, events have been the focus of study in both Linguistics and NLP (Chen and Ji, 2009; Bejan and Harabagiu, 2008, 2010; Humphreys et al., 1997). Determining when two event mentions in text corefer is, however, an unsolved problem. Past work in NLP has avoided some of the more complex problems by considering only certain types of coreference, or by simply ignoring the major problems. The results have been partial, or inconsistent, annotations. In this paper we describe our approach to the problem of coreference among events. In order to build a corpus containing event coreference links that is annotated with high enough inter-annotator agreement to be useful for machine learning, it has proven necessary to create a model of event identity that is more elaborate than is usually assumed in the NLP literature, and to formulate quite specific definitions for its central concepts. 2 In this work, we mean both events and states when we say “event”. A state refers to a fixed, or regularly changing, configuration of entities in the world, such as “it is hot” or “he is running”. An event occurs when there is a change of state in the world, such as “he stops running” or “the plane took off”. Event coreference is the problem of determining when two mentions in a text refer to the ‘same’ event. Whether or not the event actually occurred in reality is a separate issue; a text can describe people flying around on dragons or broomsticks. While the events might be actual occurrences, hypothesized or desired ones, etc., they exist in the text as Discourse Elements (DEs), and this is what we consider in this work. Each DE is referred to (explicitly or implicitly) in the text by a mention, for example “destroy”, “the attack”, “that event”, or “it”. But it is often unclear whether two mentions refer to the same DE or to closely related ones, or to something altogether different. The following example illustrates two principal problems of event coreference: While Turkish troops have been fighting_E.1 a Kurdish faction in northern Iraq, two other Kurdish groups have been battling_E.2 each other. A radio station operated_E.3 by the Kurdistan Democratic Party said_E.4 the party's forces attacked_E.5 positions of the Patriotic Union of Kurdistan on Monday in the Kurdish region's capital Irbil. The Voice of Iraqi Kurdistan radio, monitored_E.6 by the British Broadcasting Corp., said_E.7 more than 80 Patriotic Union fighters were killed_E.8 and at least 150 wounded_E.9. The fighting_E.10 was also reported_E.11 by a senior Patriotic Union official, Kusret Rasul Ali, who said_E.12 PUK forces repelled_E.13 a large KDP attack_E.14.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The Identity of Moses in Surah Al-Qasas with Reference to Time and Space

The question of identity in a narrative text is one of the most influential questions that need further study. The variations in the factors that may affect the concept of identity add to the complexity of the narrative text. The study aims at analyzing the main phases, stages, themes and events of Moses’ life story as part of the narrative discourse. The effects of time and place on the main e...

متن کامل

بررسی نقش معنویت، صفات شخصیتی (پنج بزرگ)، سبک‌های هویتی، و تاب‌آوری در پیش‌بینی عضویت در گروه‌ها ی معتاد و غیرمعتاد

  B ackground and Objectives: Meny studies have shown that addicts and non-addicts are different in personality traits, identity styles, resiliency and spirituality, but the role of each component in predicting addictive behavior and discriminating between addict and non-addict groups is not clear. So, the purpose of the current research was to identify the role of predictive factors according ...

متن کامل

Forming a unique identity derived from cultural values hidden in the collective memories of citizens

BACKGROUND AND OBJECTIVES: The city is not just about appearance. People are present and live with architectural and urban spaces. The changes of Bojnord city after becoming the provincial capital have had many and sometimes destructive effects on the city. The issue of bad identity of cities is not a new one, but how to deal with it, despite the history of the problem, is stil...

متن کامل

Analysis of national identity in textbooks

Today, national identity is one of the most important issues in developing countries is planned. Much of the training of national identity in the context of formal education takes place .Beginning with the period of fundamental transformation textbooks is also consistent with the new approach. The present article deals with the analysis of national education and from this point of clarification...

متن کامل

On the Link between Identity Processing and Learning Styles among Young Language learners

The present study attempted to investigate the probable relationship between Iranian young language learners’ identity processing styles and their learning styles. To this end, 29 advanced learners, 23 females and 6 males were randomly selected from an English language Institute. Twenty nine advanced young language learners were chosen randomly out of whole advanced young language learners in t...

متن کامل

The Role of Perceived Gender Discrimination and Identity Styles in Predicting Learned Helplessness in Girls Experiencing Home Running

The aim of this study was to investigate the role of perceived gender discrimination and identity styles in predicting learned helplessness in girls experiencing home run. This research was descriptive and correlational. The statistical population of the study was girls referring to night care centers and shelters in Tehran, District 12 (Shoosh neighborhood) in the first quarter of 1400, from w...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013